Temporal effects in the growth of networks 
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We show that to explain the growth of the citation network by preferential attachment (PA), one 
has to accept that individual nodes exhibit heterogeneous fitness values that decay with time. While 
previous PA-based models assumed either heterogeneity or decay in isolation, we propose a simple 
analytically treatable model that combines these two factors. Depending on the input assumptions, 
the resulting degree distribution shows an exponential, log-normal or power-law decay, which makes 
the model an apt candidate for modeling a wide range of real systems. 



Over the years, models with preferential attachment 
(PA) were independently proposed to explain the distri- 
bution of the number of species in a genus |l[ , the power- 
law distribution of the number of citations received by 
scientific papers Q and the number of links pointing to 
WWW pages [4j. A theoretical description of this class 
of processes and the observation that they generally lead 
to power-law distributions are due to Simon 0. Notably, 
the application of PA to WWW data by Barabasi and Al- 
bert helped to initiate the lively field of complex networks 
Their network model, which stands at the center of 
attention of this work, was much studied and general- 
ized to include effects such as presence of purely random 
connections , non- linear dependence on the degree Q , 
node fitness |8j and others d Ch. 8]. 

Despite its success in providing a common roof for 
many theoretical models and empirical data sets, pref- 
erential attachment is still little developed to take into 
account the temporal effects of network growth. For ex- 
ample, it predicts a strong relation between a node's age 
and its degree. While such first-mover advantage [10( 
plays a fundamental role for the emergence of scale free 
topologies in the model, it is a rather unrealistic feature 
for several real systems (e.g., it is entirely absent in the 
WWW [111 and significant deviations are found in cita- 
tion data [lfj, |l2|). This motivates us to study a model 



of a growing network where a broad degree distribution 
does not result from strong time bias in the system. To 
this end we assign fitness to each node and assume that 
this fitness decays with time — we refer it as relevance 
henceforth. Instead of simply classifying the vertices as 
active or inactive, as done in [l3|, 14 1 , we use real data to 
investigate the relevance distribution and decay therein 
and build a model where decaying and heterogeneous rel- 
evance are combined. 

Models with decaying fitness values ("aging") were 
shown to produce narrow degree distributions (except for 
very slow decay) [IH and widely distributed fitness val- 
ues were shown to produce extremely broad distributions 
or even a condensation phenomenon where a single node 
attracts a macroscopic fraction of all links 16]. We show 
that when these two effects act together, they produce 
various classes of behavior, many of which are compati- 
ble with structures observed in real data sets. 



Before specifying a model and attempting to solve it, 
we turn to data to provide support for our hypothesis 
of decaying relevance. We use here the citation data 
provided by the American Physical Society (APS) which 
contains all 450084 papers published by the APS from 
1893 to 2009 together with their 4 691938 citations of 
other papers from APS journals. It is particularly fit- 
ting to use citation data for our work because ordinary 
PA with direct proportionality to the node degree was 
detected in this case by previous works [13, E3- Data 
analysis according to [1 81 ] reveals that the best power-law 
fit to the in-degree data has lower bound fc m i n = 50 and 
exponent 2.79 ± 0.01. Though p- values greater than 0.10 
are only achieved for fc m j n > 150, log-normal distribution 
does not appear to fit the data particularly better. Since 
PA can be best imagined to model citations within one 
field of research, we consider in our analysis also a sub- 
set of papers about the theory of networks. We identify 
them using the PACS number 89. 75. He ("Networks and 
genealogical trees" ) — in this way we obtain a small data 
set with 985 papers and 4 395 citations among them. 

Denoting the in-degree of paper i at time t as ki(t) 
and assuming that during next At days, C(t, At) new ci- 
tations are added to papers in the network, preferential 
attachment predicts that the number of citations received 
by paper i is Ah(t,At) PA = C(t, At)ki(t)/Y,j kj(t). If 
in reality, Aki(t, At) citations are received, the ratio be- 
tween this number and the expected number of received 
citations defines the paper's relevance 



X t (t,At) := 



Ah(t,At) Zjkjjt) 
C{t,At)h{t) ' 



(1) 



This expression is obviously undefined for fcj (t) = which 
stems from the known limitation of the PA in requiring 
an additional attractiveness factor to allow new papers 
to gain their first citation. Although one could try to 
include this effect in our analysis, we simply compute 
Xi(t, At) only when ki(t) > 1. Similarly we exclude time 
periods when no citations are given and C(t, At) = 0. 

Figure [1] shows how the average relevance of papers 
with different final in-degree values decays with time af- 
ter their publication. We see that the relevance values 
indeed decay and this decay is initially very fast (for pa- 
pers with the highest final in-degree, it is by a factor of 
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years after publication 

FIG. 1. Time decay of the average relevance values (based 
on At = 91 days) for papers divided into groups according 
to their final in-degree (color online). The dashed line shows 
X = 1 indicating exact preferential attachment and open cir- 
cles show the initial relevance values. The inset shows results 
for papers about the theory of networks. 
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FIG. 2. The distribution of the total relevance Xt in the 
studied data (color online). For Xt > 25 • 10 3 , f(Xr) decays 
as exp[-aX T ] with a = (21.7 ±0.2) ■ 10~ 3 (denoted with the 
indicative dashed line). The peak at Xt — is due to ap- 
proximately 60 000 papers without citations. The inset shows 
results for papers about the theory of networks. 



100 in less than three years). However, the exponential 
decay reported in [1 91 ] appears to have only very limited 
validity (up to five years after the publication date). Af- 
ter 10 or more years, the decay becomes very slow or 
even vanishes, producing a stationary relevance value tq. 
Figure [2] depicts the distribution of the total relevance 
Xx{i) = Y] t Xj(t) and shows that, perhaps contrary to 
one's expectations, this distribution is rather narrow with 
an exponential decay for Xt > 25 • 10 3 . An exponential- 
like tail appears also when the analysis is restricted to 
papers of a similar age which means that it is not only 
an artifact of the papers' age distribution. One could 
attempt to fit this data with, for example, a Weibull dis- 
tribution as in [2l| . We shall see later that it is the tail 
behavior of Xt what determines the tail behavior of the 
degree distribution, hence the current level of detail suf- 
fices our purpose. We can conclude that in the studied 
citation data, relevance values exhibit time decay and 
papers' total relevances are rather homogeneously dis- 
tributed, showing an exponential decay in the tail. 

Now we proceed to a model based on the above- 
reported empirical observations. We consider a uniformly 
growing undirected network which initially consists of 
two connected nodes. At time t, a new node is introduced 
and linked to an existing node i where the probability of 
choosing node i is 



P(i,t) 



ki(t)Rj(t) 



(2) 
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which has the same structure as assumed before in 
[l9j . Here kj (t) and Rj (t) is degree and relevance of node 
j at time t, respectively [20j]. Our goal is to determine 
whether a stationary degree distribution exists and find 
its functional form. 

Eq. ^ represents a complicated system where evolu- 
tion of each node's degree depends not only on the node 



itself but also on the current degrees and relevances of 
all other nodes. The key simplification is based on the 
assumption that at any time moment (except for a short 
initial period) , there are many nodes with non- negligible 
values of ki(t)Ri(t). The denominator of Eq. © is then a 
sum over many contributing terms and therefore it fluc- 
tuates little with time. This allows us to approximate 
the exact selection probability P(i,t) with 



P(i,t) 



ki(t)Rj(t) 



(3) 



where f2(i) is now just a normalization factor. 

If Ri(t) decays sufficiently fast (faster than 1/t) and 
Ymvt^oo Ri{t) — 0, the initial growth of Sl(t) stabilizes 
at a certain value f2* which shall be determined later by 
the requirement of self-consistency. The master equa- 
tion for the degree distribution p(ki, t) now has the form 

p(k u t+l) = (l-kiRi(t)/il*)p(ki,t) + (ki-l)Ri(t)p(ki- 

l,i)/Q*. Note that the stationarity of p(ki,t) in our 
case is due to transition probabilities that vanish be- 
cause lim f ^oo Ri(t) — 0. Before tackling the degree dis- 
tribution itself, we examine the expected final degree of 
node i, (kf). By multiplying the master equation with k% 
and summing it over all k iy we obtain a difference equa- 
tion (ki(t + 1)) = (&;(*)) (1 + R l (t)/n*). If Ri(t) decays 
sufficiently slowly, we can switch to continuous time to 
obtain d{ki(t)}/dt = ki(t)Ri(t)/Cl* which together with 
(ki{ti)) = 1 yields 



{kf 



exp 



(4) 



Here U is the time when node i is introduced to the 
system (in our case, ti — i). When the continuum ap- 
proximation is valid, this result is well confirmed by nu- 
merical simulations (see the inset in Fig. [3]). To observe 
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saturation of the degree growth for an infinitely grow- 
ing network, the total relevance Tj := J. Ri(t)dt must 
be finite and hence Ri(t) must decay faster than 1/t 
for all nodes. To assess the error of the continuum ap- 
proximation, one can use the Taylor expansion to write 
(ki(t + 1)> - (ki(t)) « d(ki(t))/dt + \& 2 {h{t))/dt 2 . The 
second derivative term can be approximately evaluated 
using Eq. Q and it can be shown that it's negligible 
when <C Ri(t), which is consistent with our initial 

assumption that Ri{t) decays sufficiently slowly for all i. 

Since tt* is the same for all nodes, Eq. Q demonstrates 
that a node's expected final degree depends only on its 
total relevance Tj. Therefore we can use the continuum 
approach to compute fi* directly from its definition as 
n* = J g(T)(Q(T)) dT where (ft(T)) a lim^ /„* R(t - 
t )(k(t - t )) dt = f™R(t){k(t))dt = Q*(e T / n * - 1), 
as there is only one node with total relevance T which 
contributes to (0(T)) with R(t){k(t)) for each t. When 
g(T) is given, the resulting equation 



J g(T) e T/n 'dT = 2 



(5) 



can be used to find f2*. Alternatively, the construction 
constraint of the average network's degree in the large 
time limit, (k) = 2, implies J °° g(T)(k F (T)) dT = 2 
which gives the same equation for Note that when 
g(T) decays slower than exponentially, the integral in 
Eq. ([5]) diverges and no Cl* can satisfy the system's re- 
quirements, implying that in this case no stationary value 
of f2* is established. 

Similarly to degree fluctuations for nodes of 

a given total relevance can be derived from the master 
equation. When |i?j(i)| <C Ri(t), the continuum approx- 
imation can be again shown to be valid and yields 



d{kf)/dt = R l (t)((k t (t)) + 2(kf(t)))/n* 



(6) 



where (kf(0)) = 1 and which can be solved for general 
Ri {t) to obtain the stationary standard deviation of the 
node's degree 



<Jk(Ti) 



3 2Ti/n* _ T</rr\V 2 



(7) 



When Tj = T for all nodes, Eq. © implies e T / n " = 2 
and therefore 07. — v2. We see that the resulting degree 
distribution f(k) is very narrow which is not the case 
in most real complex networks. One has to proceed to 
heterogeneous Tj values. 

Since the distribution f(ki\Ti) is very narrow, one 
can use the distribution g(T) together with Eq. ^ and 
f(k) dk = g(T) dT to obtain the degree distribution f(k). 
If Ti are drawn from a distribution with finite support, 
the support of f(k) is also finite which is not of interest 
for us (though it may be appropriate to model some sys- 
tems). If Ti follow a truncated normal distribution (the 
truncation is needed to ensure Tj > and (fej) > 1), it 




FIG. 3. Simulation results for the studied model where 
Ri(t) = Ri(0) e - ^ t-t *' (ti is the time when node i entered 
the network), Ri(0) values are drawn from an exponential 
distribution and the final number of nodes is 10 5 (color on- 
line). Since the decay is the same for all nodes, distributions 
of Ri(0) and Ti have the same functional form. The indicative 
dashed line has the slope of —3. The inset shows the depen- 
dency between /3T and the average node degree; the dashed 
line follows from Eq. (|4]). 



follows immediately that f(k) is log-normally distributed 
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which may be of great relevance in many cases 
We finally consider T values that follow a fast-decaying 
exponential distribution g(T) = acxp[— oT] which is 
supported by the analysis of citation data presented in 
Figure [5] By transforming from g(T) to f(k), we then 
obtain f(k) - k-^ an * . From Eq. © it follows that 
in this case is fl* — 2/a, hence the power-law exponent 
is 7 = 3. We see that even a very constrained expo- 
nential distribution of T leads to a scale-free distribution 
of node degree — the exponent of this distribution is in 
fact the same as in the original PA model. As shown 
in Fig. [31 numerical simulations confirm that this result 
truly realizes in a wide range of parameters. 

Motivated by Fig. El one may ask what happens when 
T is exponentially distributed only in its tail. We take a 
simple combination where 1 — q of all nodes have T = 1 
and the remaining nodes follow the exponential distri- 
bution g(T) = e-^" 1 ) for T € [l;oo). By the same 
approach as before, we obtain the equation for Q* in the 
form e 1 ^* [1 -q+q/(l - I/O,*)] = 2 which yields power- 
law exponents monotonically increasing from 2.44 (for 
q = 0) to 4.18 (for q = 1). The reason for the exponent 
decreasing as q shrinks is that when q is small, every 
node with a potentially high exponentially-distributed T 
value has few able competitors during its life span and 
therefore it is likely to acquire many links (more than for 
q = 1). At the same time, as q decreases, the power-law 
tail contains smaller and smaller fraction of all nodes and 
becomes less visible. This example further demonstrates 
flexibility of the studied model which is able to produce 
different kinds of behavior depending on the input pa- 
rameters. 
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It is easy to show that as long as Ri(t) values decay 
faster than 1/t, the growth of kj(t) is sublinear and the 
condensation phase observed in [16] is not possible de- 
spite T having an unlimited support. However, in the 
system numerically studied in Fig. [31 deviations from the 
scale free distribution of node degree appear when f3 is 
small. This happens when the characteristic lifetime of a 
node, is so long that the decay cannot compensate 
for the unlimited support of g(T). To get a qualitative 
estimate for the value of f3 when these deviations appear, 
we use the following argument. If the final degree distri- 
bution is a power law with exponent 7, we expect (A: max ) 
to grow as fVC-y-i) = y/i (h ere we use that the number 
of nodes equals t). When a node with a sufficiently high 
relevance appears, the system can undergo a temporary 
condensation phase where this node acquires a finite frac- 
tion of links during its lifetime. To avoid a deviation from 
the power law behavior, this lifetime must not be longer 
than (fc max ), hence j3 < l/Vt. As t goes to 00, j3 can be 
arbitrarily small and yet no deviations appear. This con- 
firms that in the thermodynamic limit, the condensation 
phase does not realize in our model. 

The key formula Q builds on the assumption that 
fluctuations of f2(t) are small enough, and the degree 
distribution results hold if the effective lifetime of nodes 
is long enough (a short-living node cannot acquire many 
links regardless of its total relevance). These two as- 
sumptions are in fact closely related: when the effective 
lifetime of nodes is long, then at any time step there are 
many nodes competing for the incoming link and the time 
fluctuations of Q(t) are hence small. To evaluate the ef- 
fective life time of node i, Ti, we use the participation 
number 



(8) 



When Ti ^ 1 for all nodes, ft(t) fluctuates little. Numer- 
ical simulations show that var(il) is indeed proportional 
to the effective life time for a wide range of decay func- 
tions R(t), confirming its relevance in the present con- 
text. In conclusion, our analytical results are valid when 
all the obtained conditions (Ri(t) decreasing faster than 
1/t, \Ri(t)\ < Ri(t), and n > 1), are fulfilled. 

To summarize, we studied a model of a growing net- 
work where heterogeneous fitness (relevance) values and 
aging of nodes (time decay) are combined. We showed 
that in contrast to models where these two effects are 
considered in isolation, here we obtain various realistic 
degree distributions for a wide range of input parame- 
ters. We analyzed real citation data and showed that 
they indeed support the hypothesis of coexisting node 
heterogeneity and time decay. Even when our model is 
more realistic than the preferential attachment alone, it 
neglects several effects which might be of importance in 
various systems: directed nature of the network, acceler- 



ating growth of the network, gradual fragmentation of the 
network into related yet independent fields, and others. 
Note that the very reason for the exponential tail of the 
total fitness value T, though it is crucial for the resulting 
degree distribution, is not discussed here at all — yet we 
have empirical support for it in our data. Also the case 
when the normalization fl(t) in Eq. ([2]) does not have 
a stationary value (because limt^oo Ri(t) > or g(T) 
decays slower than exponentially) is open. Finally, note 
that while we focused on the degree distribution here, 
there are other network characteristics — such as cluster- 
ing coefficient and degree correlations — that deserve fur- 
ther attention. 
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